
Handwriting Text Input System 

This is a continutation of 08/330,573 filed on 10/28/94. 
BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

The present invention relates to a text input system to lower the burden 
of text input for an information processing and communication control system. 

As one of the text input method, a conventional system utilizes an 
abbreviation or shorthand words. For example, it has a dictionary with plural 
number of words which are compared with the data input, and the word data 
corresponding to the data input is fetched from the dictionary and no more 
input is necessary. 

In case of the above-mentioned method, it is necessary to remember 
the abbreviation and activate the system function to find a desired word by 
depressing a corresponding key, and its efficiency depends on an operator's 
capability, as the operator has to remember all the characters of the 
abbreviation or shorthand words in the dictionary which correspond to those of 
data input for the successful functioning to find a match. 

2. Description of the Prior Art 

The conventional method has a common way to accept a leading part 
of word data or a radical of Chinese character or Japanese kanji character from 
a keyboard or a handwriting input apparatus. After the operator depresses an 
enter key or a space key, the data input is compared with the contents of 
dictionary to find the word which equates the data input and display, if any. 
The operator may find the desired word on the display and select one of them 
by the depression of a specific function key. 

However, the method to press the enter key or space key as a separator 
after typing some characters to find the desired word is not suitable for typing 
on the keyboard in a blind (eyes-free) manner. And it is also unlikely that the 
operator reiterates the steps to input a line of text, character by character, for 
the system to collate with the contents of dictionary and display words which 
have the same leading part as the data input, until some of them is selected by 
the operator. Its method heavily relies on the operator's ability and does not 
necessarily contribute to the increase of the input efficiency. 

In case of the handwriting input, there would be plenty of words in the 
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dictionary with the same radical as those of Chinese characters or Japanese 
kanji characters which will cause a frequent change of a list of words for the 
selection on the display after some data input and the repeated steps to select 
the desired words among them. Such action may actually decrease the 
efficiency of data input, e.g. the system first displays the list of radicals and 
the operator selects the radical to which the system gets back to the operator 
with the relevant characters for the selection on the display by the operator. 

SUMMARY OF THE INVENTION 

The text input system of the invention, was intended to provide a way 
of data input from the keyboard or handwriting stroke input apparatus, giving 
an efficient and natural way of input for the operator without being conscious 
of it while it is in use and by the step equivalent to what they have been doing 
up until now. 

The text input system should not give an extra burden and an 
additional training to the operator, and should be able to increase an efficiency 
of the data input, using a natural way of input, as well. 

In comparison with the conventional method, the invention system 
collates a data input with the dictionary, character by character, at the time of 
each data input and replaces the data input on the display with the unique data 
from the dictionary after finding the unique one which includes data input, in a 
variety of form of data, and requires no more input. This is the way to really 
increase the efficiency of data input and lower the burden of operator on the 
typing. 

Additionally, this invention is a flexible and effective way to further 
increase the efficiency of input, by decreasing the number of words in the 
dictionary containing the words only with the number of characters more than 
the specific number, in the dictionary, by way of storing relevant words with 
the data to collate with the data input, in the dictionary. It would decrease the 
number of data input and shorten the time required to reach the character 
position which makes the word unique in the dictionary for the purpose of 
supplementing the remaining part of word from the dictionary. 

In the abbreviation input, the conventional method uses a dictionary 
consisting of an abbreviation and its original form of word, though the present 
invention is able to use a standard dictionary, as well as the abbreviation 
dictionary. 

The system does not require special abbreviation dictionary. It is able 



to utilize the standard dictionary and accept the abbreviation input which 
contains the first character and some other characters of the word data to input 
and compares those with the dictionary. Even in case of having the special 
abbreviation dictionary, the present invention does not require all the characters 
of the abbreviation to input. In this case, the operator does not have to 
remember the abbreviation and may input part of it, eg the first character 
followed by some other characters of the abbreviation and which is unique in 
the dictionary. 



Some examples for the number of character positions in a word to be 
unique in the dictionary in comparison with the total number of characters of 
the word, are given in Table 2. The examples of a line of text of characters or 
strokes are shown in table 1 . 

Also, there is a possibility to be more quick in finding the unique word 
in the dictionary, by decreasing the number of short words. For example 
"abolish" can be identified at the 4th character position (i.e. abol) in the 
selective dictionary of following table 2, though it needs 6 characters to be 
unique in the standard dictionary. 



- Table 1 - 

Word pattern element codes 
for handwriting strokes 



Original The number of position 
word to be unique in the dictionary 



02 01 12 11 02 
— I — I — 

02 11 25 12 12 00 30 
-I -I N 



2 
2 



00 30 12 
/ v - 

01 34 
I 3 

12 01 11 11 



A 
B 

E 



* In this example, "IE" or "5E" can be unique 
at the second pattern element code position. 
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Standard dictionary 


Word 


The number of 


(or 


character posi- 


line 


tion in a 


of 


word to be 


text) 


unique in the 




dictionary 



Selective dictioary 



The total 
number of 
char, of 
the word 



The number of 
character posi- 
tion in a 
word to be 
unique in the 
dictionary 



abandon 


A 


7 


abate 


4 


5 


abbot 


4 


c 


abdomen 




7 


abhor 


3 


c 
D 


abide 


4 


j 


ability 


4 


/ 


abject 


3 


6 


able 


4 


4 


abnormal 


3 


8 


aboard 


4 


6 


abolish 


6 


7 


abolition 


6 


9 


abominable 


4 


10 


abound 


5 


6 


about 


5 


5 


above 


4 


5 


abridge 


4 


7 


abroad 


4 


6 


abrupt 


4 


6 


absence 


6 


7 


absent 


6 


6 


absinthe 


4 


8 


absolute 


6 


8 


absorb 


5 


6 



abandon 3 

abdomen 3 

ability 3 

abnormal 3 

abolish 4 



about 4 

abridge 3 

absence 4 

absolute 5 
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absorbent 7 
abstain 5 



9 
7 



absorbent 5 
abstain 4 



(Total) 127 184 
(127 / 184 = 0.69 : 31% 
differences 



41 79 
(41 / 79 = 0.52 : 48% 
differences 
(79 is the number 
of total char.) 



BRIEF DESCRIPTION OF THE DRAWINGS 

By way of example, the system of the present invention is described in 
the flowcharts in Figure 2 to Figure 11, and Figure 12 to Figure 14 show 
examples of of the dictionary to use in the invention system. Figure 1 is a 
block diagram of the invention system. 

DETAILED DESCRIPTION OF THE DRAWINGS 

In the flowchart of Fig. 2 to Fig. 1 1, A001 is the first step to clear the 
contents of the counters, flags, and work area, at the beginning of the data 
input process. 

Next A002 accepts a data input, character by character, or stroke by 
stroke, from a keyboard or a handwriting input instrument. 

A003 checks if the input data is an END code, to show the end of 
transaction, and If yes, the step goes to the termination of proces. Otherwise 
the step proceeds to A003A. 

A003A tests if the NONE flag is on which means that there is no data 
including the data input in the dictionary. If YES, the step goes to S020 in 
which the ID check steps are performed to find the unique one with the last 
data same as the last input data in the remaining positions other than that was 
already collated (in Fig. 9), as the ID check steps are performed after the 
input of last data and prior to its dictionary search, utilizing a flag of NONE 
flag wh lc h is set on at A006B in Fig. 3. It means that the attempt to search at 
A006A in Fig. 3 using the previous input has failed and the NONE flag was 
set ON for the purpose to activate ID check steps to find the data for "leading 
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data and some other data to be unique in the dictionary." If None flag is off, 
the step proceeds to next A003B. 

A003B tests if the input data is a separator code. If it is not separator 
code, the process proceeds to A004 in which input data is displayed at the end 
of data input on the display. If it is the separator code at the testing of A003B 
the step goes to A003C in which the input buffer is cleared and the step goes 
back to A002 for the next data input. 

after displaying the input data at A004, the next A005 adds the data 
input to the end of previous data string in the input buffer memory. Then, the 
step goes to A006 that is the subprogram of dictionary search. 

A006 performs a dictionary search to get the data which has the same 
leading part as the data input and which is unique in the dictionary, as one 
example in the case, because there are some other cases to find the data which 
is unique, despite the different form of data, e.g. the data input of the first data 
followed by some other data for the line of text data to represent the unique 
one in the dictionary and so on. 

Next A006A tests if the result of dictionary search at A006 shows that 
there are data to retrieve in this example. If there is no data, the step goes to 
A006B m which the NONE flag is turned ON and goes back to A002 for the 
next data input. In this example, the NONE flag is used for the purpose to do a 
sequence of "input of following data and ID check" to find the unique data 
with the same last data as the last data input or with the same last data input in 
the remaining parts other than that was already collated. If the test result at 
A006A is not negative, which means that there are still data to retrieve step 
A007 tests if the step of A006 found the unique one in the dictionary by the 
;'flag for the unique ON." If OFF, the step goes back to A002 for the next data 
input to continue the input and dictionary search. 

At A008 branching from A007, the system replaces input data on the 
display with the unique data from dictionary, the contents of input buffer is 
cleared for the next data input, and goes to next A008A. 

At A008A is the selection step for the case having plural number of 
relevant words in the dictionary after finding the unique data in the dictionary, 
m a manner that one of relevant data is selected on the display in this case' 
because the other selection steps are performed by ID check at S020 in Fig. 9 
testing the existence of data which is unique and terminates with the same one 
as the last data input or which is unique and includes the same one as the last 
data input in the remaining part that was already collated. Then the step goes 
back to A002 for the next data input. 
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SOOl is an entry of the subprogram to search in the dictionary. S002 is 
to test if there is no more word to retrieve in the dictionary. If so, the process 
goes to the exit of this subprogram. If there are words to retrieve at the testing 
of S002, the process goes to the S003. 

5003 gets the data located in the middle of area between an upper limit 
and a lower limit in the dictionary. In this case, the upper limit means the 
boundary with the text string which is getting larger, and the lower limit means 
the boundary which is getting smaller. 

5004 compares the data input with the data from the dictionary, and 
tests if the leading part of word from the dictionary is greater or not. 

If the greater flag is ON at S004, S005 divides the retrieval area size 
into 2 to use the lower half area for the next retrieval, as there is a possibility 
to find the unique word in the lower half area, because the dictionary data 
found at S003 was greater than the data input. Then, the step goes back to 
S002. If the greater than flag is OFF at S004, the step proceeds to S006. 

At S006, it is tested if the leading part of word from the dictionary is 
equal to that of the data input. If equal, S007 is performed to test if it is single. 
In case of a single one, the next step S008 turns on the flag for the unique data 
in the dictionary and goes to exit. If there are plural number of data at S007 
the step goes to S012. If NOT EQUAL at S006, the step proceeds to S009. 

At S009, testing is made as to whether the leading part of word from 
the dictionary is less than that of the data input. 

If the less than flag is ON after testing at S009, S010 divides the 
retrieval area size into 2 to use the upper half area for the next retrieval as 
there is a possibility to find the unique word in the upper half area because the 
dictionary data found at S003 was less than that of data input. Then the step 
goes to S002. ' 

If the less than flag is OFF at S009, there is something wrong with the 
procedure in the process and goes to the error procedure of SOU which is not 
described here, due to the matter which is not directly related to the invention 

Branching here from S007, S012 tries to find if the plural number of 
data have the same stem of word. If it is YES, the step goes to S012A in 
which the NONE flag is turned ON and the step goes back to A002 for the 
next data input. Othersie, the step goes to S012B. 

S012B gets the data with the leading part greater than that of input 
data in the upper half of current retrieval area in the dictionary. 

At the test of next SO 13 after performing the step of S012B, it is tested 
if the data was found, and goes to SO 15 if found. If the data was not found, the 
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step goes to SOI 4. 

S015 sets a new upper limit to retrieve using the positions of the data 
with the leading part of data greater than that of data input, in the dictionary 
and goes to next SO 16. 

SO 14 sets a new upper limit to retrieve in the dictionary using the 
position of the upper limit by which S012B was attempted, because the upper 
half 

area in this case was occupied by the words with the leading part equal to that 
of data input as a result of the attempt at S012B. Then, the step goes to S016 
SO 16 tries to find the word with the leading part less than that of data input in 
the lower half area of the dictionary. 

Next SO 17 tests if the data was found. 

S018 takes place if not found at S017 and sets a new lower limit using 
the position of the lower limit by which SO 16 was attempted, because the 
lower area was occupied by the words with the leading part equal to that of the 
data input as a result of the attempt at S016. Then, the step goes to the exit of 
this subprogram. 

SO 19 arises if the data was found at SO 17 and sets new lower limit to 
retrieve using the positions of the data with the leading part less than that of 
the data input and jumps to the exit of this sub-program. 

Branching from A003A when NONE flag is ON, S020 performs the 
steps named ID check in this case to start with S023 which is an entry of 
subprogram. 

S023 and followings check if there is the unique data which has the 
last data same as the last data input, or which includes the same one as the last 
data input m the remaining part other than that was already collated with the 

data input. 

Next S021 checks the flag for the unique data, after coming back from 
ID check. If its flag is OFF, the step goes back to A002 for the next data input. 
If the flag is ON, the step proceeds to next S022 which turns the NONE flag 
OFF and supplements the remaining part of data input on the display ie the 
replacement of the data input with the unique data found. Then, the step goes 
back to A002 for the next data input. 

5023 is the start of a subprogram which performs determination 
process of the unique data. S023 examines data which were selected by the 
prior look-up in the dictionary. 

5024 tests if there is the unique data which terminates with the same 
one as the last data input. If YES, the step goes to next S025 which turns the 
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FLAG for the unique data ON and goes to exit of this subprogram. If NO at 
the test of S024, the step goes to S026. 

At S026, it is tested if there is the unique data which includes the last 
data input in the remaining positions other than that was already collated with 
data input. If YES, the step goes to S025. If NO, the step proceeds to the next 
S027 in which the system extracts data which do not have the same one as the 
last data input or which do not include the same one as the last data input in 
the remaining part other than that was already collated with the data input, for 
the next retrieval. 

As a subset of the dictionary search for the data input consisting of a 
first character and some other characters, T001 which equivalent to the 
afore-mentioned S001, is an entry of this subprogram. 

T002 which is equivalent to the afore-mentioned S002 checks if there 
is still more word to retrieve in the dictionary. If there is no more word, the 
step goes to the exit. The process advances to the next T002A, if there are 
words to retrieve. 

T002A branches to the aforementioned S003 if the input is the first 
one of the data input, and goes to the next T002B if it was not the first one. 

T002B is to check if all words between upper and lower limit in the 
dictionary have the same leading part as that of the data input. 

T002C selects the words with the leading part which is not the same as 
that of the data input. 

T002D shifts one character to the left for every words selected at 
T002C and extract those words which terminated after making a character 
shift. 

T002E sets new upper and lower limit to retrieve in the dictionary after 
extracting words terminated at T002D. Then, the step goes to the 
aforementioned S003. 

Fig. 12 is an example for the word pattern element data for 
handwriting characters. In this example, assuming that "IE " is going to be 
input, at the time of input of its first stroke ■•- #3 and #4 words exist in this 
dictionary. Next stroke of "| ■■ selectes #3 only which corresponds to the input 
of "-| " in this case, and " IE " is replaced with the data input on the display. 

Fig. 13 and 14 are examples of the dictionary containing a line of text, 
original word, and relevant words. 



- 9 - 



